67 research outputs found

    Unconventional machine learning of genome-wide human cancer data

    Full text link
    Recent advances in high-throughput genomic technologies coupled with exponential increases in computer processing and memory have allowed us to interrogate the complex aberrant molecular underpinnings of human disease from a genome-wide perspective. While the deluge of genomic information is expected to increase, a bottleneck in conventional high-performance computing is rapidly approaching. Inspired in part by recent advances in physical quantum processors, we evaluated several unconventional machine learning (ML) strategies on actual human tumor data. Here we show for the first time the efficacy of multiple annealing-based ML algorithms for classification of high-dimensional, multi-omics human cancer data from the Cancer Genome Atlas. To assess algorithm performance, we compared these classifiers to a variety of standard ML methods. Our results indicate the feasibility of using annealing-based ML to provide competitive classification of human cancer types and associated molecular subtypes and superior performance with smaller training datasets, thus providing compelling empirical evidence for the potential future application of unconventional computing architectures in the biomedical sciences

    Cholesterol-Independent SREBP-1 Maturation Is Linked to ARF1 Inactivation

    Get PDF
    Lipogenesis requires coordinated expression of genes for fatty acid, phospholipid, and triglyceride synthesis. Transcription factors, such as SREBP-1 (Sterol regulatory element binding protein), may be activated in response to feedback mechanisms linking gene activation to levels of metabolites in the pathways. SREBPs can be regulated in response to membrane cholesterol and we also found that low levels of phosphatidylcholine (a methylated phospholipid) led to SBP-1/SREBP-1 maturation in C. elegans or mammalian models. To identify additional regulatory components, we performed a targeted RNAi screen in C. elegans, finding that both lpin-1/Lipin 1 (which converts phosphatidic acid to diacylglycerol) and arf-1.2/ARF1 (a GTPase regulating Golgi function) were important for low-PC activation of SBP-1/SREBP-1. Mechanistically linking the major hits of our screen, we find that limiting PC synthesis or LPIN1 knockdown in mammalian cells reduces the levels of active GTP-bound ARF1. Thus, changes in distinct lipid ratios may converge on ARF1 to increase SBP-1/SREBP-1 activity

    Endothelial Mitogen-Activated Protein Kinase Kinase Kinase Kinase 4 Is Critical for Lymphatic Vascular Development and Function

    Get PDF
    The molecular mechanisms underlying lymphatic vascular development and function are not well understood. Recent studies have suggested a role for endothelial cell (EC) mitogen-activated protein kinase kinase kinase kinase 4 (Map4k4) in developmental angiogenesis and atherosclerosis. Here, we show that constitutive loss of EC Map4k4 in mice causes postnatal lethality due to chylothorax, suggesting that Map4k4 is required for normal lymphatic vascular function. Mice constitutively lacking EC Map4k4 displayed dilated lymphatic capillaries, insufficient lymphatic valves, and impaired lymphatic flow; furthermore, primary ECs derived from these animals displayed enhanced proliferation compared with controls. Yeast 2-hybrid analyses identified the Ras GTPase-activating protein Rasa1, a known regulator of lymphatic development and lymphatic endothelial cell fate, as a direct interacting partner for Map4k4. Map4k4 silencing in ECs enhanced basal Ras and extracellular signal-regulated kinase (Erk) activities, and primary ECs lacking Map4k4 displayed enhanced lymphatic EC marker expression. Taken together, these results reveal that EC Map4k4 is critical for lymphatic vascular development by regulating EC quiescence and lymphatic EC fate

    Genome Evolution and Innovation across the Four Major Lineages of Cryptococcus gattii

    Get PDF
    We acknowledge the Broad Institute Sequencing Platform and Imperial College London for generating the DNA sequence described here (and R265 Illumina sequences described previously [4]). We thank Sinéad Chapman for coordinating sequencing at the Broad Institute and Margaret Priest for assistance in submitting assemblies to NCBI. This project was supported by the National Human Genome Research Institute, grant no. U54HG003067. R.A.F. is supported by the Wellcome Trust. R.C.M. is supported by the Lister Institute for Preventive Medicine, the Medical Research Council UK, and the European Research Council.Peer reviewedPublisher PD

    Ligand-activated BMP signaling inhibits cell differentiation and death to promote melanoma

    Get PDF
    Oncogenomic studies indicate that copy number variation (CNV) alters genes involved in tumor progression; however, identification of specific driver genes affected by CNV has been difficult, as these rearrangements are often contained in large chromosomal intervals among several bystander genes. Here, we addressed this problem and identified a CNV-targeted oncogene by performing comparative oncogenomics of human and zebrafish melanomas. We determined that the gene encoding growth differentiation factor 6 (GDF6), which is the ligand for the BMP family, is recurrently amplified and transcriptionally upregulated in melanoma. GDF6-induced BMP signaling maintained a trunk neural crest gene signature in melanomas. Additionally, GDF6 repressed the melanocyte differentiation gene MITF and the proapoptotic factor SOX9, thereby preventing differentiation, inhibiting cell death, and promoting tumor growth. GDF6 was specifically expressed in melanomas but not melanocytes. Moreover, GDF6 expression levels in melanomas were inversely correlated with patient survival. Our study has identified a fundamental role for GDF6 and BMP signaling in governing an embryonic cell gene signature to promote melanoma progression, thus providing potential opportunities for targeted therapy to treat GDF6-positive cancers

    Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome

    Get PDF
    The germline genome of the binucleated ciliate Tetrahymena thermophila undergoes programmed chromosome breakage and massive DNA elimination to generate the somatic genome. Here, we present a complete sequence assembly of the germline genome and analyze multiple features of its structure and its relationship to the somatic genome, shedding light on the mechanisms of genome rearrangement as well as the evolutionary history of this remarkable germline/soma differentiation. Our results strengthen the notion that a complex, dynamic, and ongoing interplay between mobile DNA elements and the host genome have shaped Tetrahymena chromosome structure, locally and globally. Non-standard outcomes of rearrangement events, including the generation of short-lived somatic chromosomes and excision of DNA interrupting protein-coding regions, may represent novel forms of developmental gene regulation. We also compare Tetrahymenas germline/soma differentiation to that of other characterized ciliates, illustrating the wide diversity of adaptations that have occurred within this phylum.</p

    The Dynamic Genome and Transcriptome of the Human Fungal Pathogen Blastomyces and Close Relative Emmonsia

    Get PDF
    Three closely related thermally dimorphic pathogens are causal agents of major fungal diseases affecting humans in the Americas: blastomycosis, histoplasmosis and paracoccidioidomycosis. Here we report the genome sequence and analysis of four strains of the etiological agent of blastomycosis, Blastomyces, and two species of the related genus Emmonsia, typically pathogens of small mammals. Compared to related species, Blastomyces genomes are highly expanded, with long, often sharply demarcated tracts of low GC-content sequence. These GC-poor isochore-like regions are enriched for gypsy elements, are variable in total size between isolates, and are least expanded in the avirulent B. dermatitidis strain ER-3 as compared with the virulent B. gilchristii strain SLH14081. The lack of similar regions in related species suggests these isochore-like regions originated recently in the ancestor of the Blastomyces lineage. While gene content is highly conserved between Blastomyces and related fungi, we identified changes in copy number of genes potentially involved in host interaction, including proteases and characterized antigens. In addition, we studied gene expression changes of B. dermatitidis during the interaction of the infectious yeast form with macrophages and in a mouse model. Both experiments highlight a strong antioxidant defense response in Blastomyces, and upregulation of dioxygenases in vivo suggests that dioxide produced by antioxidants may be further utilized for amino acid metabolism. We identify a number of functional categories upregulated exclusively in vivo, such as secreted proteins, zinc acquisition proteins, and cysteine and tryptophan metabolism, which may include critical virulence factors missed before in in vitro studies. Across the dimorphic fungi, loss of certain zinc acquisition genes and differences in amino acid metabolism suggest unique adaptations of Blastomyces to its host environment. These results reveal the dynamics of genome evolution and of factors contributing to virulence in Blastomyces.Author SummaryDimorphic fungal pathogens including Blastomyces are the cause of major fungal diseases in North and South America. The genus Emmonsia includes species infecting small mammals as well as a newly emerging pathogenic species recently reported in HIV-positive patients in South Africa. Here, we synthesize both genome sequencing of four isolates of Blastomyces and two species of Emmonsia as well as deep sequencing of Blastomyces RNA to draw major new insights into the evolution of this group and the pathogen response to infection. We investigate the trajectory of genome evolution of this group, characterizing the phylogenetic relationships of these species, a remarkable genome expansion that formed large isochore-like regions of low GC content in Blastomyces, and variation of gene content, related to host interaction, among the dimorphic fungal pathogens. Using RNA-Seq, we profile the response of Blastomyces to macrophage and mouse pulmonary infection, identifying key pathways and novel virulence factors. The identification of key fungal genes involved in adaptation to the host suggests targets for further study and therapeutic intervention in Blastomyces and related dimorphic fungal pathogens

    Whole Genome Deep Sequencing of HIV-1 Reveals the Impact of Early Minor Variants Upon Immune Recognition During Acute Infection

    Get PDF
    Deep sequencing technologies have the potential to transform the study of highly variable viral pathogens by providing a rapid and cost-effective approach to sensitively characterize rapidly evolving viral quasispecies. Here, we report on a high-throughput whole HIV-1 genome deep sequencing platform that combines 454 pyrosequencing with novel assembly and variant detection algorithms. In one subject we combined these genetic data with detailed immunological analyses to comprehensively evaluate viral evolution and immune escape during the acute phase of HIV-1 infection. The majority of early, low frequency mutations represented viral adaptation to host CD8+ T cell responses, evidence of strong immune selection pressure occurring during the early decline from peak viremia. CD8+ T cell responses capable of recognizing these low frequency escape variants coincided with the selection and evolution of more effective secondary HLA-anchor escape mutations. Frequent, and in some cases rapid, reversion of transmitted mutations was also observed across the viral genome. When located within restricted CD8 epitopes these low frequency reverting mutations were sufficient to prime de novo responses to these epitopes, again illustrating the capacity of the immune response to recognize and respond to low frequency variants. More importantly, rapid viral escape from the most immunodominant CD8+ T cell responses coincided with plateauing of the initial viral load decline in this subject, suggestive of a potential link between maintenance of effective, dominant CD8 responses and the degree of early viremia reduction. We conclude that the early control of HIV-1 replication by immunodominant CD8+ T cell responses may be substantially influenced by rapid, low frequency viral adaptations not detected by conventional sequencing approaches, which warrants further investigation. These data support the critical need for vaccine-induced CD8+ T cell responses to target more highly constrained regions of the virus in order to ensure the maintenance of immunodominant CD8 responses and the sustained decline of early viremia

    Analysis of the Genome and Transcriptome of Cryptococcus neoformans var. grubii Reveals Complex RNA Expression and Microevolution Leading to Virulence Attenuation

    Get PDF
    Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence
    • …
    corecore